Dimension Compatibility for Data Mart Integration

نویسندگان

  • Luca Cabibbo
  • Riccardo Torlone
چکیده

The problem of integrating autonomous data marts arises when, e.g., a large organization (or a federation thereof) needs to combine independently developed data warehouses. It turns out that this problem can be tackled in a systematic way because of two main reasons. First, data marts are usually structured in a rather uniform way, along dimensions and facts. Second, data quality in data marts is usually higher than in generic databases, since they are obtained by reconciling several data sources. Our scenario of reference is a federation of various data marts that we need to query in a unified way by means of drillacross operations. We propose a novel notion of dimension compatibility and characterize its general properties. We then show the significance of dimension compatibility in performing drill-across queries over autonomous data marts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inferring Aggregation Hierarchies for Integration of Data Marts

The problem of integrating heterogeneous data marts is an important problem in building enterprise data warehouses. Specially identifying compatible dimensions is crucial to successful integration. Existing notions of dimension compatibility rely on given and exact dimension hierarchy information being available. In this paper, we propose to infer aggregation hierarchies for dimensions from a d...

متن کامل

Semi-automatic Discovery of Mappings Between Heterogeneous Data Warehouse Dimensions

Data Warehousing is the main Business Intelligence instrument for the analysis of large amounts of data. It permits the extraction of relevant information for decision making processes inside organizations. Given the great diffusion of Data Warehouses, there is an increasing need to integrate information coming from independent Data Warehouses or from independently developed data marts in the s...

متن کامل

From Data Mart to Information Smart : Substation Automated Analysis Implementation

The paper discusses substation IED data integration and its importance for implementation of automated analysis solutions. Recorded data collected from various substation IEDs is stored into a substation data mart that utilizes standardized file formats and a database interface. The data mart provides a foundation for multiple uses of substation data. Utilities are faced with a challenge of how...

متن کامل

Data Mart Designing and Integration Approaches

Today companies need strategic information to counter fiercer competition, extend market share and improve profitability. So they need information system that is subject oriented, integrated, non volatile and time variant. Data warehouse is the viable solution. It is integrated repository of data gathered from many sources and used by the entire enterprise. In order to standardize data analysis...

متن کامل

Adapting Multidimensional Schemes to Data sources using Algebraic Operators

Designing a decisional system requires a methodology different from those commonly adopted for operational information systems. In our methodology data marts are constructed on the basis of user requirements specified using OLAP design patterns. Since these patterns are independent of any data source, the data mart design process should solve the problems due to differences between user OLAP re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004